Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 68187 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 9.9 MiB |
| Average record size in memory | 152.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 9 |
age is highly overall correlated with age_years | High correlation |
weight is highly overall correlated with bmi and 1 other fields | High correlation |
ap_hi is highly overall correlated with ap_lo and 2 other fields | High correlation |
ap_lo is highly overall correlated with ap_hi and 2 other fields | High correlation |
age_years is highly overall correlated with age | High correlation |
bmi is highly overall correlated with weight and 1 other fields | High correlation |
New_BMI is highly overall correlated with weight and 1 other fields | High correlation |
bp_category is highly overall correlated with ap_hi and 2 other fields | High correlation |
bp_category_encoded is highly overall correlated with ap_hi and 2 other fields | High correlation |
gluc is highly imbalanced (52.2%) | Imbalance |
smoke is highly imbalanced (57.1%) | Imbalance |
alco is highly imbalanced (70.0%) | Imbalance |
id is uniformly distributed | Uniform |
id has unique values | Unique |
Reproduction
| Analysis started | 2023-10-25 12:34:44.452172 |
|---|---|
| Analysis finished | 2023-10-25 12:34:51.952611 |
| Duration | 7.5 seconds |
| Software version | ydata-profiling vv4.6.0 |
| Download configuration | config.json |
id
Real number (ℝ)
UNIFORM  UNIQUE 
| Distinct | 68187 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49972.49 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4951.3 |
| Q1 | 24989.5 |
| median | 50009 |
| Q3 | 74879.5 |
| 95-th percentile | 94936.1 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 49890 |
Descriptive statistics
| Standard deviation | 28853.618 |
|---|---|
| Coefficient of variation (CV) | 0.57739003 |
| Kurtosis | -1.198473 |
| Mean | 49972.49 |
| Median Absolute Deviation (MAD) | 24951 |
| Skewness | -0.001482544 |
| Sum | 3.4074742 × 109 |
| Variance | 8.3253125 × 108 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 66649 | 1 | < 0.1% |
| 66625 | 1 | < 0.1% |
| 66626 | 1 | < 0.1% |
| 66628 | 1 | < 0.1% |
| 66630 | 1 | < 0.1% |
| 66631 | 1 | < 0.1% |
| 66632 | 1 | < 0.1% |
| 66633 | 1 | < 0.1% |
| 66635 | 1 | < 0.1% |
| Other values (68177) | 68177 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 12 | 1 | |
| 13 | 1 | |
| 14 | 1 |
| Value | Count | Frequency (%) |
| 99999 | 1 | |
| 99998 | 1 | |
| 99996 | 1 | |
| 99995 | 1 | |
| 99993 | 1 | |
| 99992 | 1 | |
| 99991 | 1 | |
| 99990 | 1 | |
| 99988 | 1 | |
| 99986 | 1 |
age
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 8060 |
|---|---|
| Distinct (%) | 11.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19462.585 |
| Minimum | 10798 |
|---|---|
| Maximum | 23713 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 10798 |
|---|---|
| 5-th percentile | 15054.3 |
| Q1 | 17656 |
| median | 19700 |
| Q3 | 21323 |
| 95-th percentile | 23256 |
| Maximum | 23713 |
| Range | 12915 |
| Interquartile range (IQR) | 3667 |
Descriptive statistics
| Standard deviation | 2468.3322 |
|---|---|
| Coefficient of variation (CV) | 0.12682448 |
| Kurtosis | -0.82604311 |
| Mean | 19462.585 |
| Median Absolute Deviation (MAD) | 1713 |
| Skewness | -0.30476348 |
| Sum | 1.3270953 × 109 |
| Variance | 6092663.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19741 | 32 | < 0.1% |
| 18253 | 31 | < 0.1% |
| 21892 | 30 | < 0.1% |
| 18236 | 30 | < 0.1% |
| 18184 | 30 | < 0.1% |
| 19733 | 29 | < 0.1% |
| 20389 | 29 | < 0.1% |
| 20376 | 29 | < 0.1% |
| 20442 | 29 | < 0.1% |
| 19770 | 28 | < 0.1% |
| Other values (8050) | 67890 |
| Value | Count | Frequency (%) |
| 10798 | 1 | < 0.1% |
| 10859 | 1 | < 0.1% |
| 10878 | 1 | < 0.1% |
| 10964 | 1 | < 0.1% |
| 14275 | 1 | < 0.1% |
| 14277 | 1 | < 0.1% |
| 14282 | 1 | < 0.1% |
| 14284 | 1 | < 0.1% |
| 14287 | 1 | < 0.1% |
| 14291 | 3 |
| Value | Count | Frequency (%) |
| 23713 | 1 | |
| 23701 | 1 | |
| 23692 | 1 | |
| 23690 | 1 | |
| 23687 | 1 | |
| 23684 | 1 | |
| 23678 | 1 | |
| 23677 | 1 | |
| 23675 | 2 | |
| 23673 | 2 |
gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| 1 | |
|---|---|
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68187 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 44414 | |
| 2 | 23773 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 44414 | |
| 2 | 23773 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 44414 | |
| 2 | 23773 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68187 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 44414 | |
| 2 | 23773 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68187 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 44414 | |
| 2 | 23773 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68187 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 44414 | |
| 2 | 23773 |
height
Real number (ℝ)
| Distinct | 105 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 164.37604 |
| Minimum | 55 |
|---|---|
| Maximum | 250 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 55 |
|---|---|
| 5-th percentile | 152 |
| Q1 | 159 |
| median | 165 |
| Q3 | 170 |
| 95-th percentile | 178 |
| Maximum | 250 |
| Range | 195 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 8.1716715 |
|---|---|
| Coefficient of variation (CV) | 0.049713276 |
| Kurtosis | 7.6541922 |
| Mean | 164.37604 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.60913976 |
| Sum | 11208309 |
| Variance | 66.776215 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 165 | 5728 | 8.4% |
| 160 | 4889 | 7.2% |
| 170 | 4583 | 6.7% |
| 168 | 4307 | 6.3% |
| 164 | 3323 | 4.9% |
| 158 | 3236 | 4.7% |
| 162 | 3179 | 4.7% |
| 169 | 2741 | 4.0% |
| 156 | 2681 | 3.9% |
| 167 | 2486 | 3.6% |
| Other values (95) | 31034 |
| Value | Count | Frequency (%) |
| 55 | 1 | < 0.1% |
| 57 | 1 | < 0.1% |
| 59 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 64 | 1 | < 0.1% |
| 65 | 2 | |
| 67 | 3 | |
| 68 | 2 | |
| 70 | 2 | |
| 71 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 250 | 1 | < 0.1% |
| 207 | 1 | < 0.1% |
| 198 | 14 | |
| 197 | 4 | < 0.1% |
| 196 | 6 | |
| 195 | 6 | |
| 194 | 2 | < 0.1% |
| 193 | 6 | |
| 192 | 12 | |
| 191 | 11 |
weight
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 267 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.112623 |
| Minimum | 35 |
|---|---|
| Maximum | 200 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 35 |
|---|---|
| 5-th percentile | 55 |
| Q1 | 65 |
| median | 72 |
| Q3 | 82 |
| 95-th percentile | 100 |
| Maximum | 200 |
| Range | 165 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 14.27153 |
|---|---|
| Coefficient of variation (CV) | 0.19256545 |
| Kurtosis | 2.5525326 |
| Mean | 74.112623 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 1.0160874 |
| Sum | 5053517.4 |
| Variance | 203.67658 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 65 | 3779 | 5.5% |
| 70 | 3692 | 5.4% |
| 68 | 2767 | 4.1% |
| 75 | 2675 | 3.9% |
| 60 | 2670 | 3.9% |
| 80 | 2569 | 3.8% |
| 72 | 2249 | 3.3% |
| 69 | 2152 | 3.2% |
| 78 | 2035 | 3.0% |
| 74 | 1827 | 2.7% |
| Other values (257) | 41772 |
| Value | Count | Frequency (%) |
| 35 | 2 | < 0.1% |
| 35.45 | 1 | < 0.1% |
| 36 | 5 | < 0.1% |
| 37 | 6 | < 0.1% |
| 38 | 7 | < 0.1% |
| 39 | 9 | < 0.1% |
| 40 | 41 | |
| 41 | 34 | |
| 42 | 48 | |
| 42.2 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 200 | 2 | |
| 183 | 1 | < 0.1% |
| 180 | 4 | |
| 178 | 3 | |
| 177 | 1 | < 0.1% |
| 172 | 1 | < 0.1% |
| 171 | 1 | < 0.1% |
| 170 | 3 | |
| 169 | 1 | < 0.1% |
| 168 | 3 |
ap_hi
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 86 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 126.43746 |
| Minimum | 90 |
|---|---|
| Maximum | 180 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 90 |
|---|---|
| 5-th percentile | 100 |
| Q1 | 120 |
| median | 120 |
| Q3 | 140 |
| 95-th percentile | 160 |
| Maximum | 180 |
| Range | 90 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 15.961509 |
|---|---|
| Coefficient of variation (CV) | 0.12624035 |
| Kurtosis | 0.76062622 |
| Mean | 126.43746 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.73997479 |
| Sum | 8621391 |
| Variance | 254.76978 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 120 | 27649 | |
| 140 | 9323 | 13.7% |
| 130 | 8905 | 13.1% |
| 110 | 8612 | 12.6% |
| 150 | 4196 | 6.2% |
| 160 | 2792 | 4.1% |
| 100 | 2560 | 3.8% |
| 90 | 928 | 1.4% |
| 170 | 647 | 0.9% |
| 180 | 602 | 0.9% |
| Other values (76) | 1973 | 2.9% |
| Value | Count | Frequency (%) |
| 90 | 928 | 1.4% |
| 93 | 1 | < 0.1% |
| 95 | 28 | < 0.1% |
| 96 | 2 | < 0.1% |
| 99 | 4 | < 0.1% |
| 100 | 2560 | |
| 101 | 4 | < 0.1% |
| 102 | 8 | < 0.1% |
| 103 | 8 | < 0.1% |
| 104 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 180 | 602 | |
| 179 | 4 | < 0.1% |
| 178 | 2 | < 0.1% |
| 177 | 2 | < 0.1% |
| 176 | 3 | < 0.1% |
| 175 | 14 | < 0.1% |
| 174 | 3 | < 0.1% |
| 173 | 2 | < 0.1% |
| 172 | 8 | < 0.1% |
| 171 | 8 | < 0.1% |
ap_lo
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 58 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 81.265564 |
| Minimum | 60 |
|---|---|
| Maximum | 120 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 60 |
|---|---|
| 5-th percentile | 70 |
| Q1 | 80 |
| median | 80 |
| Q3 | 90 |
| 95-th percentile | 100 |
| Maximum | 120 |
| Range | 60 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 9.1433353 |
|---|---|
| Coefficient of variation (CV) | 0.11251181 |
| Kurtosis | 0.9324608 |
| Mean | 81.265564 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.23905336 |
| Sum | 5541255 |
| Variance | 83.60058 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 34719 | |
| 90 | 14236 | |
| 70 | 10206 | 15.0% |
| 100 | 3978 | 5.8% |
| 60 | 2654 | 3.9% |
| 79 | 357 | 0.5% |
| 110 | 338 | 0.5% |
| 85 | 290 | 0.4% |
| 75 | 209 | 0.3% |
| 95 | 158 | 0.2% |
| Other values (48) | 1042 | 1.5% |
| Value | Count | Frequency (%) |
| 60 | 2654 | |
| 61 | 5 | < 0.1% |
| 62 | 7 | < 0.1% |
| 63 | 7 | < 0.1% |
| 64 | 10 | < 0.1% |
| 65 | 78 | 0.1% |
| 66 | 11 | < 0.1% |
| 67 | 19 | < 0.1% |
| 68 | 13 | < 0.1% |
| 69 | 98 | 0.1% |
| Value | Count | Frequency (%) |
| 120 | 134 | 0.2% |
| 119 | 2 | < 0.1% |
| 115 | 7 | < 0.1% |
| 114 | 1 | < 0.1% |
| 113 | 3 | < 0.1% |
| 112 | 1 | < 0.1% |
| 111 | 1 | < 0.1% |
| 110 | 338 | |
| 109 | 6 | < 0.1% |
| 108 | 3 | < 0.1% |
cholesterol
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68187 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 51209 | |
| 2 | 9187 | 13.5% |
| 3 | 7791 | 11.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 51209 | |
| 2 | 9187 | 13.5% |
| 3 | 7791 | 11.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 51209 | |
| 2 | 9187 | 13.5% |
| 3 | 7791 | 11.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68187 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 51209 | |
| 2 | 9187 | 13.5% |
| 3 | 7791 | 11.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68187 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 51209 | |
| 2 | 9187 | 13.5% |
| 3 | 7791 | 11.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68187 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 51209 | |
| 2 | 9187 | 13.5% |
| 3 | 7791 | 11.4% |
gluc
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| 1 | |
|---|---|
| 3 | 5179 |
| 2 | 4996 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68187 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 58012 | |
| 3 | 5179 | 7.6% |
| 2 | 4996 | 7.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 58012 | |
| 3 | 5179 | 7.6% |
| 2 | 4996 | 7.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 58012 | |
| 3 | 5179 | 7.6% |
| 2 | 4996 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68187 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 58012 | |
| 3 | 5179 | 7.6% |
| 2 | 4996 | 7.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68187 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 58012 | |
| 3 | 5179 | 7.6% |
| 2 | 4996 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68187 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 58012 | |
| 3 | 5179 | 7.6% |
| 2 | 4996 | 7.3% |
smoke
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| 0 | |
|---|---|
| 1 | 5978 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68187 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 62209 | |
| 1 | 5978 | 8.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 62209 | |
| 1 | 5978 | 8.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 62209 | |
| 1 | 5978 | 8.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68187 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 62209 | |
| 1 | 5978 | 8.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68187 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 62209 | |
| 1 | 5978 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68187 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 62209 | |
| 1 | 5978 | 8.8% |
alco
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| 0 | |
|---|---|
| 1 | 3623 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68187 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 64564 | |
| 1 | 3623 | 5.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 64564 | |
| 1 | 3623 | 5.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 64564 | |
| 1 | 3623 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68187 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 64564 | |
| 1 | 3623 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68187 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 64564 | |
| 1 | 3623 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68187 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 64564 | |
| 1 | 3623 | 5.3% |
active
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68187 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 54790 | |
| 0 | 13397 | 19.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 54790 | |
| 0 | 13397 | 19.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 54790 | |
| 0 | 13397 | 19.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68187 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 54790 | |
| 0 | 13397 | 19.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68187 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 54790 | |
| 0 | 13397 | 19.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68187 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 54790 | |
| 0 | 13397 | 19.6% |
cardio
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68187 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 34522 | |
| 1 | 33665 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 34522 | |
| 1 | 33665 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 34522 | |
| 1 | 33665 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68187 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 34522 | |
| 1 | 33665 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68187 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 34522 | |
| 1 | 33665 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68187 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 34522 | |
| 1 | 33665 |
age_years
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 28 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.823397 |
| Minimum | 29 |
|---|---|
| Maximum | 64 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 29 |
|---|---|
| 5-th percentile | 41 |
| Q1 | 48 |
| median | 53 |
| Q3 | 58 |
| 95-th percentile | 63 |
| Maximum | 64 |
| Range | 35 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 6.7697955 |
|---|---|
| Coefficient of variation (CV) | 0.12815903 |
| Kurtosis | -0.8213646 |
| Mean | 52.823397 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.30352224 |
| Sum | 3601869 |
| Variance | 45.830132 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 55 | 3824 | 5.6% |
| 53 | 3751 | 5.5% |
| 57 | 3567 | 5.2% |
| 54 | 3528 | 5.2% |
| 56 | 3507 | 5.1% |
| 59 | 3481 | 5.1% |
| 49 | 3335 | 4.9% |
| 58 | 3311 | 4.9% |
| 51 | 3273 | 4.8% |
| 52 | 3193 | 4.7% |
| Other values (18) | 33417 |
| Value | Count | Frequency (%) |
| 29 | 3 | < 0.1% |
| 30 | 1 | < 0.1% |
| 39 | 1749 | |
| 40 | 1590 | |
| 41 | 1855 | |
| 42 | 1388 | |
| 43 | 1981 | |
| 44 | 1475 | |
| 45 | 2039 | |
| 46 | 1594 |
| Value | Count | Frequency (%) |
| 64 | 2121 | |
| 63 | 2651 | |
| 62 | 2134 | |
| 61 | 2647 | |
| 60 | 3127 | |
| 59 | 3481 | |
| 58 | 3311 | |
| 57 | 3567 | |
| 56 | 3507 | |
| 55 | 3824 |
bmi
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 3739 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.514316 |
| Minimum | 12.254473 |
|---|---|
| Maximum | 298.66667 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 12.254473 |
|---|---|
| 5-th percentile | 20.936639 |
| Q1 | 23.875115 |
| median | 26.346494 |
| Q3 | 30.116213 |
| 95-th percentile | 37.253645 |
| Maximum | 298.66667 |
| Range | 286.41219 |
| Interquartile range (IQR) | 6.2410984 |
Descriptive statistics
| Standard deviation | 6.0223563 |
|---|---|
| Coefficient of variation (CV) | 0.21888083 |
| Kurtosis | 231.26197 |
| Mean | 27.514316 |
| Median Absolute Deviation (MAD) | 2.9229366 |
| Skewness | 7.8396139 |
| Sum | 1876118.7 |
| Variance | 36.268775 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23.87511478 | 930 | 1.4% |
| 23.4375 | 641 | 0.9% |
| 24.22145329 | 485 | 0.7% |
| 25.71166208 | 359 | 0.5% |
| 22.03856749 | 354 | 0.5% |
| 23.03004535 | 342 | 0.5% |
| 24.8015873 | 325 | 0.5% |
| 23.52941176 | 313 | 0.5% |
| 24.97704316 | 284 | 0.4% |
| 25.390625 | 279 | 0.4% |
| Other values (3729) | 63875 |
| Value | Count | Frequency (%) |
| 12.25447288 | 1 | |
| 12.85583104 | 1 | |
| 13.52082207 | 1 | |
| 13.76 | 1 | |
| 14.47950008 | 1 | |
| 14.52737603 | 1 | |
| 14.57725948 | 1 | |
| 14.6092038 | 1 | |
| 14.69237833 | 1 | |
| 14.70113665 | 2 |
| Value | Count | Frequency (%) |
| 298.6666667 | 1 | |
| 278.125 | 1 | |
| 267.768595 | 1 | |
| 237.7686328 | 1 | |
| 191.6666667 | 1 | |
| 187.7500769 | 1 | |
| 180.6780742 | 1 | |
| 178.9627465 | 1 | |
| 178.2134106 | 1 | |
| 170.4142012 | 1 |
bp_category
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| Hypertension Stage 1 | |
|---|---|
| Hypertension Stage 2 | |
| Normal | |
| Elevated | 3100 |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 17.522607 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1194814 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hypertension Stage 1 |
|---|---|
| 2nd row | Hypertension Stage 2 |
| 3rd row | Hypertension Stage 1 |
| 4th row | Hypertension Stage 2 |
| 5th row | Normal |
Common Values
| Value | Count | Frequency (%) |
| Hypertension Stage 1 | 39743 | |
| Hypertension Stage 2 | 15935 | |
| Normal | 9409 | 13.8% |
| Elevated | 3100 | 4.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| hypertension | 55678 | |
| stage | 55678 | |
| 1 | 39743 | |
| 2 | 15935 | 8.9% |
| normal | 9409 | 5.2% |
| elevated | 3100 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 173234 | |
| t | 114456 | 9.6% |
| 111356 | 9.3% | |
| n | 111356 | 9.3% |
| a | 68187 | 5.7% |
| r | 65087 | 5.4% |
| o | 65087 | 5.4% |
| H | 55678 | 4.7% |
| g | 55678 | 4.7% |
| y | 55678 | 4.7% |
| Other values (12) | 319017 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 903915 | |
| Uppercase Letter | 123865 | 10.4% |
| Space Separator | 111356 | 9.3% |
| Decimal Number | 55678 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 173234 | |
| t | 114456 | |
| n | 111356 | |
| a | 68187 | 7.5% |
| r | 65087 | 7.2% |
| o | 65087 | 7.2% |
| g | 55678 | 6.2% |
| y | 55678 | 6.2% |
| i | 55678 | 6.2% |
| s | 55678 | 6.2% |
| Other values (5) | 83796 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 55678 | |
| S | 55678 | |
| N | 9409 | 7.6% |
| E | 3100 | 2.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 39743 | |
| 2 | 15935 |
Space Separator
| Value | Count | Frequency (%) |
| 111356 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1027780 | |
| Common | 167034 | 14.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 173234 | |
| t | 114456 | |
| n | 111356 | |
| a | 68187 | 6.6% |
| r | 65087 | 6.3% |
| o | 65087 | 6.3% |
| H | 55678 | 5.4% |
| g | 55678 | 5.4% |
| y | 55678 | 5.4% |
| S | 55678 | 5.4% |
| Other values (9) | 207661 |
Common
| Value | Count | Frequency (%) |
| 111356 | ||
| 1 | 39743 | 23.8% |
| 2 | 15935 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1194814 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 173234 | |
| t | 114456 | 9.6% |
| 111356 | 9.3% | |
| n | 111356 | 9.3% |
| a | 68187 | 5.7% |
| r | 65087 | 5.4% |
| o | 65087 | 5.4% |
| H | 55678 | 4.7% |
| g | 55678 | 4.7% |
| y | 55678 | 4.7% |
| Other values (12) | 319017 |
bp_category_encoded
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| Hypertension Stage 1 | |
|---|---|
| Hypertension Stage 2 | |
| Normal | |
| Elevated | 3100 |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 17.522607 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1194814 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hypertension Stage 1 |
|---|---|
| 2nd row | Hypertension Stage 2 |
| 3rd row | Hypertension Stage 1 |
| 4th row | Hypertension Stage 2 |
| 5th row | Normal |
Common Values
| Value | Count | Frequency (%) |
| Hypertension Stage 1 | 39743 | |
| Hypertension Stage 2 | 15935 | |
| Normal | 9409 | 13.8% |
| Elevated | 3100 | 4.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| hypertension | 55678 | |
| stage | 55678 | |
| 1 | 39743 | |
| 2 | 15935 | 8.9% |
| normal | 9409 | 5.2% |
| elevated | 3100 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 173234 | |
| t | 114456 | 9.6% |
| 111356 | 9.3% | |
| n | 111356 | 9.3% |
| a | 68187 | 5.7% |
| r | 65087 | 5.4% |
| o | 65087 | 5.4% |
| H | 55678 | 4.7% |
| g | 55678 | 4.7% |
| y | 55678 | 4.7% |
| Other values (12) | 319017 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 903915 | |
| Uppercase Letter | 123865 | 10.4% |
| Space Separator | 111356 | 9.3% |
| Decimal Number | 55678 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 173234 | |
| t | 114456 | |
| n | 111356 | |
| a | 68187 | 7.5% |
| r | 65087 | 7.2% |
| o | 65087 | 7.2% |
| g | 55678 | 6.2% |
| y | 55678 | 6.2% |
| i | 55678 | 6.2% |
| s | 55678 | 6.2% |
| Other values (5) | 83796 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 55678 | |
| S | 55678 | |
| N | 9409 | 7.6% |
| E | 3100 | 2.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 39743 | |
| 2 | 15935 |
Space Separator
| Value | Count | Frequency (%) |
| 111356 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1027780 | |
| Common | 167034 | 14.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 173234 | |
| t | 114456 | |
| n | 111356 | |
| a | 68187 | 6.6% |
| r | 65087 | 6.3% |
| o | 65087 | 6.3% |
| H | 55678 | 5.4% |
| g | 55678 | 5.4% |
| y | 55678 | 5.4% |
| S | 55678 | 5.4% |
| Other values (9) | 207661 |
Common
| Value | Count | Frequency (%) |
| 111356 | ||
| 1 | 39743 | 23.8% |
| 2 | 15935 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1194814 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 173234 | |
| t | 114456 | 9.6% |
| 111356 | 9.3% | |
| n | 111356 | 9.3% |
| a | 68187 | 5.7% |
| r | 65087 | 5.4% |
| o | 65087 | 5.4% |
| H | 55678 | 4.7% |
| g | 55678 | 4.7% |
| y | 55678 | 4.7% |
| Other values (12) | 319017 |
New_BMI
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 3740 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.514316 |
| Minimum | 12.254473 |
|---|---|
| Maximum | 298.66667 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 12.254473 |
|---|---|
| 5-th percentile | 20.936639 |
| Q1 | 23.875115 |
| median | 26.346494 |
| Q3 | 30.116213 |
| 95-th percentile | 37.253645 |
| Maximum | 298.66667 |
| Range | 286.41219 |
| Interquartile range (IQR) | 6.2410984 |
Descriptive statistics
| Standard deviation | 6.0223563 |
|---|---|
| Coefficient of variation (CV) | 0.21888083 |
| Kurtosis | 231.26197 |
| Mean | 27.514316 |
| Median Absolute Deviation (MAD) | 2.9229366 |
| Skewness | 7.8396139 |
| Sum | 1876118.7 |
| Variance | 36.268775 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23.87511478 | 930 | 1.4% |
| 23.4375 | 641 | 0.9% |
| 24.22145329 | 485 | 0.7% |
| 25.71166208 | 359 | 0.5% |
| 22.03856749 | 354 | 0.5% |
| 23.03004535 | 342 | 0.5% |
| 24.8015873 | 325 | 0.5% |
| 23.52941176 | 313 | 0.5% |
| 24.97704316 | 284 | 0.4% |
| 25.390625 | 279 | 0.4% |
| Other values (3730) | 63875 |
| Value | Count | Frequency (%) |
| 12.25447288 | 1 | |
| 12.85583104 | 1 | |
| 13.52082207 | 1 | |
| 13.76 | 1 | |
| 14.47950008 | 1 | |
| 14.52737603 | 1 | |
| 14.57725948 | 1 | |
| 14.6092038 | 1 | |
| 14.69237833 | 1 | |
| 14.70113665 | 2 |
| Value | Count | Frequency (%) |
| 298.6666667 | 1 | |
| 278.125 | 1 | |
| 267.768595 | 1 | |
| 237.7686328 | 1 | |
| 191.6666667 | 1 | |
| 187.7500769 | 1 | |
| 180.6780742 | 1 | |
| 178.9627465 | 1 | |
| 178.2134106 | 1 | |
| 170.4142012 | 1 |
| id | age | height | weight | ap_hi | ap_lo | age_years | bmi | New_BMI | gender | cholesterol | gluc | smoke | alco | active | cardio | bp_category | bp_category_encoded | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| id | 1.000 | 0.003 | -0.002 | -0.002 | 0.003 | -0.001 | 0.003 | -0.001 | -0.001 | 0.013 | 0.006 | 0.000 | 0.004 | 0.000 | 0.006 | 0.006 | 0.006 | 0.006 |
| age | 0.003 | 1.000 | -0.082 | 0.062 | 0.222 | 0.157 | 0.999 | 0.108 | 0.108 | 0.052 | 0.113 | 0.071 | 0.048 | 0.028 | 0.014 | 0.240 | 0.112 | 0.112 |
| height | -0.002 | -0.082 | 1.000 | 0.314 | 0.021 | 0.031 | -0.084 | -0.183 | -0.183 | 0.415 | 0.031 | 0.012 | 0.169 | 0.089 | 0.014 | 0.017 | 0.037 | 0.037 |
| weight | -0.002 | 0.062 | 0.314 | 1.000 | 0.276 | 0.249 | 0.063 | 0.848 | 0.848 | 0.170 | 0.099 | 0.085 | 0.070 | 0.066 | 0.024 | 0.171 | 0.139 | 0.139 |
| ap_hi | 0.003 | 0.222 | 0.021 | 0.276 | 1.000 | 0.741 | 0.223 | 0.278 | 0.278 | 0.086 | 0.174 | 0.089 | 0.031 | 0.038 | 0.021 | 0.463 | 0.678 | 0.678 |
| ap_lo | -0.001 | 0.157 | 0.031 | 0.249 | 0.741 | 1.000 | 0.158 | 0.244 | 0.244 | 0.072 | 0.131 | 0.065 | 0.025 | 0.045 | 0.009 | 0.365 | 0.723 | 0.723 |
| age_years | 0.003 | 0.999 | -0.084 | 0.063 | 0.223 | 0.158 | 1.000 | 0.110 | 0.110 | 0.051 | 0.112 | 0.070 | 0.048 | 0.029 | 0.015 | 0.240 | 0.112 | 0.112 |
| bmi | -0.001 | 0.108 | -0.183 | 0.848 | 0.278 | 0.244 | 0.110 | 1.000 | 1.000 | 0.065 | 0.041 | 0.040 | 0.015 | 0.000 | 0.010 | 0.053 | 0.040 | 0.040 |
| New_BMI | -0.001 | 0.108 | -0.183 | 0.848 | 0.278 | 0.244 | 0.110 | 1.000 | 1.000 | 0.065 | 0.041 | 0.040 | 0.015 | 0.000 | 0.010 | 0.053 | 0.040 | 0.040 |
| gender | 0.013 | 0.052 | 0.415 | 0.170 | 0.086 | 0.072 | 0.051 | 0.065 | 0.065 | 1.000 | 0.037 | 0.021 | 0.338 | 0.171 | 0.003 | 0.005 | 0.080 | 0.080 |
| cholesterol | 0.006 | 0.113 | 0.031 | 0.099 | 0.174 | 0.131 | 0.112 | 0.041 | 0.041 | 0.037 | 1.000 | 0.393 | 0.024 | 0.043 | 0.012 | 0.221 | 0.122 | 0.122 |
| gluc | 0.000 | 0.071 | 0.012 | 0.085 | 0.089 | 0.065 | 0.070 | 0.040 | 0.040 | 0.021 | 0.393 | 1.000 | 0.019 | 0.029 | 0.011 | 0.091 | 0.063 | 0.063 |
| smoke | 0.004 | 0.048 | 0.169 | 0.070 | 0.031 | 0.025 | 0.048 | 0.015 | 0.015 | 0.338 | 0.024 | 0.019 | 1.000 | 0.338 | 0.025 | 0.016 | 0.020 | 0.020 |
| alco | 0.000 | 0.028 | 0.089 | 0.066 | 0.038 | 0.045 | 0.029 | 0.000 | 0.000 | 0.171 | 0.043 | 0.029 | 0.338 | 1.000 | 0.024 | 0.008 | 0.030 | 0.030 |
| active | 0.006 | 0.014 | 0.014 | 0.024 | 0.021 | 0.009 | 0.015 | 0.010 | 0.010 | 0.003 | 0.012 | 0.011 | 0.025 | 0.024 | 1.000 | 0.038 | 0.014 | 0.014 |
| cardio | 0.006 | 0.240 | 0.017 | 0.171 | 0.463 | 0.365 | 0.240 | 0.053 | 0.053 | 0.005 | 0.221 | 0.091 | 0.016 | 0.008 | 0.038 | 1.000 | 0.373 | 0.373 |
| bp_category | 0.006 | 0.112 | 0.037 | 0.139 | 0.678 | 0.723 | 0.112 | 0.040 | 0.040 | 0.080 | 0.122 | 0.063 | 0.020 | 0.030 | 0.014 | 0.373 | 1.000 | 1.000 |
| bp_category_encoded | 0.006 | 0.112 | 0.037 | 0.139 | 0.678 | 0.723 | 0.112 | 0.040 | 0.040 | 0.080 | 0.122 | 0.063 | 0.020 | 0.030 | 0.014 | 0.373 | 1.000 | 1.000 |
| id | age | gender | height | weight | ap_hi | ap_lo | cholesterol | gluc | smoke | alco | active | cardio | age_years | bmi | bp_category | bp_category_encoded | New_BMI | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 18393 | 2 | 168 | 62.0 | 110 | 80 | 1 | 1 | 0 | 0 | 1 | 0 | 50 | 21.967120 | Hypertension Stage 1 | Hypertension Stage 1 | 21.967120 |
| 1 | 1 | 20228 | 1 | 156 | 85.0 | 140 | 90 | 3 | 1 | 0 | 0 | 1 | 1 | 55 | 34.927679 | Hypertension Stage 2 | Hypertension Stage 2 | 34.927679 |
| 2 | 2 | 18857 | 1 | 165 | 64.0 | 130 | 70 | 3 | 1 | 0 | 0 | 0 | 1 | 51 | 23.507805 | Hypertension Stage 1 | Hypertension Stage 1 | 23.507805 |
| 3 | 3 | 17623 | 2 | 169 | 82.0 | 150 | 100 | 1 | 1 | 0 | 0 | 1 | 1 | 48 | 28.710479 | Hypertension Stage 2 | Hypertension Stage 2 | 28.710479 |
| 4 | 4 | 17474 | 1 | 156 | 56.0 | 100 | 60 | 1 | 1 | 0 | 0 | 0 | 0 | 47 | 23.011177 | Normal | Normal | 23.011177 |
| 5 | 8 | 21914 | 1 | 151 | 67.0 | 120 | 80 | 2 | 2 | 0 | 0 | 0 | 0 | 60 | 29.384676 | Hypertension Stage 1 | Hypertension Stage 1 | 29.384676 |
| 6 | 9 | 22113 | 1 | 157 | 93.0 | 130 | 80 | 3 | 1 | 0 | 0 | 1 | 0 | 60 | 37.729725 | Hypertension Stage 1 | Hypertension Stage 1 | 37.729725 |
| 7 | 12 | 22584 | 2 | 178 | 95.0 | 130 | 90 | 3 | 3 | 0 | 0 | 1 | 1 | 61 | 29.983588 | Hypertension Stage 1 | Hypertension Stage 1 | 29.983588 |
| 8 | 13 | 17668 | 1 | 158 | 71.0 | 110 | 70 | 1 | 1 | 0 | 0 | 1 | 0 | 48 | 28.440955 | Normal | Normal | 28.440955 |
| 9 | 14 | 19834 | 1 | 164 | 68.0 | 110 | 60 | 1 | 1 | 0 | 0 | 0 | 0 | 54 | 25.282570 | Normal | Normal | 25.282570 |
| id | age | gender | height | weight | ap_hi | ap_lo | cholesterol | gluc | smoke | alco | active | cardio | age_years | bmi | bp_category | bp_category_encoded | New_BMI | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 68195 | 99986 | 15094 | 1 | 168 | 72.0 | 110 | 70 | 1 | 1 | 0 | 0 | 1 | 1 | 41 | 25.510204 | Normal | Normal | 25.510204 |
| 68196 | 99988 | 20609 | 1 | 159 | 72.0 | 130 | 90 | 2 | 2 | 0 | 0 | 1 | 0 | 56 | 28.479886 | Hypertension Stage 1 | Hypertension Stage 1 | 28.479886 |
| 68197 | 99990 | 18792 | 1 | 161 | 56.0 | 170 | 90 | 1 | 1 | 0 | 0 | 1 | 1 | 51 | 21.604105 | Hypertension Stage 2 | Hypertension Stage 2 | 21.604105 |
| 68198 | 99991 | 19699 | 1 | 172 | 70.0 | 130 | 90 | 1 | 1 | 0 | 0 | 1 | 1 | 53 | 23.661439 | Hypertension Stage 1 | Hypertension Stage 1 | 23.661439 |
| 68199 | 99992 | 21074 | 1 | 165 | 80.0 | 150 | 80 | 1 | 1 | 0 | 0 | 1 | 1 | 57 | 29.384757 | Hypertension Stage 1 | Hypertension Stage 1 | 29.384757 |
| 68200 | 99993 | 19240 | 2 | 168 | 76.0 | 120 | 80 | 1 | 1 | 1 | 0 | 1 | 0 | 52 | 26.927438 | Hypertension Stage 1 | Hypertension Stage 1 | 26.927438 |
| 68201 | 99995 | 22601 | 1 | 158 | 126.0 | 140 | 90 | 2 | 2 | 0 | 0 | 1 | 1 | 61 | 50.472681 | Hypertension Stage 2 | Hypertension Stage 2 | 50.472681 |
| 68202 | 99996 | 19066 | 2 | 183 | 105.0 | 180 | 90 | 3 | 1 | 0 | 1 | 0 | 1 | 52 | 31.353579 | Hypertension Stage 2 | Hypertension Stage 2 | 31.353579 |
| 68203 | 99998 | 22431 | 1 | 163 | 72.0 | 135 | 80 | 1 | 2 | 0 | 0 | 0 | 1 | 61 | 27.099251 | Hypertension Stage 1 | Hypertension Stage 1 | 27.099251 |
| 68204 | 99999 | 20540 | 1 | 170 | 72.0 | 120 | 80 | 2 | 1 | 0 | 0 | 1 | 0 | 56 | 24.913495 | Hypertension Stage 1 | Hypertension Stage 1 | 24.913495 |